# Multi-dialect Support
Xiyansql QwenCoder 3B 2504
Apache-2.0
XiYanSQL-QwenCoder-3B-2504 is the latest SQL generation model released by XGenerationLab. Optimized through fine-tuning and GRPO training, it supports multiple dialects and delivers efficient and accurate SQL generation capabilities.
Large Language Model
Safetensors Supports Multiple Languages
X
XGenerationLab
182
3
Xiyansql QwenCoder 7B 2504
Apache-2.0
A fine-tuned SQL generation model based on QwenCoder, supporting multiple dialects with excellent performance
Text Generation
Safetensors Supports Multiple Languages
X
XGenerationLab
266
2
Mms 300m Arabic Dialect Identifier
This model is fine-tuned from MMS-300m for Arabic dialect speech recognition, capable of identifying Modern Standard Arabic and four major Arabic dialects.
Audio Classification
Transformers Arabic

M
badrex
73
2
Whisper Small Tel
Apache-2.0
A speech recognition model fine-tuned on Telugu audio datasets based on OpenAI Whisper-large-v2
Speech Recognition
Transformers Other

W
sagarchapara
17
1
F5 TTS Arabic
A high-quality Arabic speech synthesis model fine-tuned based on F5-TTS, supporting diverse pronunciations and accents from different regions
Speech Synthesis Supports Multiple Languages
F
IbrahimSalah
104
11
Audiox South V1
Apache-2.0
AudioX is a multilingual automatic speech recognition model developed by Jivi AI, specifically optimized for South Indian languages, supporting Tamil, Telugu, Kannada, and Malayalam.
Speech Recognition Other
A
jiviai
148
1
Chat2db SQL 7B
Apache-2.0
A 7-billion-parameter model fine-tuned on CodeLlama, specifically designed for natural language to SQL tasks, supporting multiple SQL dialects and 16k context length processing
Large Language Model
Transformers Supports Multiple Languages

C
Chat2DB
382
51
Indic Whisper Hi Multi Gpu
MIT
IndicWhisper is a cutting-edge speech recognition model optimized for Indian languages, excelling in various benchmarks for Indian languages.
Speech Recognition Other
I
parthiv11
72
4
Whisper Base Arabic
Apache-2.0
An Arabic speech recognition model based on Whisper-base, fine-tuned on multiple Arabic datasets, specializing in Arabic speech-to-text tasks
Speech Recognition
Transformers Supports Multiple Languages

W
YazanSalameh
46
3
Arat5 Arabic Dialects Translation
Apache-2.0
This model is trained on Arabic dialect datasets for translating Arabic dialects into Modern Standard Arabic (MSA).
Machine Translation
Transformers Arabic

A
PRAli22
136
4
Speecht5 Finetuned Fleurs Zh
MIT
A Chinese text-to-speech model fine-tuned on the fleurs dataset based on microsoft/speecht5_tts
Speech Synthesis
Transformers

S
GCYY
117
1
Whisper Small Cv11 French
Apache-2.0
A French automatic speech recognition model fine-tuned based on openai/whisper-small, trained on the Common Voice 11.0 French dataset, supporting case sensitivity and punctuation prediction.
Speech Recognition
Transformers French

W
bofenghuang
266
4
Whisper Telugu Base
Apache-2.0
A Telugu automatic speech recognition (ASR) model fine-tuned based on OpenAI Whisper-base, trained on multiple public Telugu datasets
Speech Recognition Other
W
vasista22
279
10
Whisper Medium Ar
Apache-2.0
A speech recognition model fine-tuned on Arabic datasets based on openai/whisper-medium
Speech Recognition
Transformers

W
arbml
49
3
Whisper Large Sme
Apache-2.0
A Northern Sami speech recognition model fine-tuned on Whisper-large-v2, achieving a word error rate of 24.91% on the test set
Speech Recognition
Transformers Other

W
NbAiLab
40
5
Opus Mt Tc Big Ar En
This is a neural machine translation model for Arabic to English translation, part of the OPUS-MT project.
Machine Translation
Transformers Supports Multiple Languages

O
Helsinki-NLP
18.14k
18
Wav2vec2 Large Hindicone
Apache-2.0
This model is a speech recognition model fine-tuned on the Common Voice dataset based on facebook/wav2vec2-xls-r-300m, supporting Hindi.
Speech Recognition
Transformers

W
SAGAR4REAL
16
0
Wav2vec2 Xlsr Romansh Sursilvan
Apache-2.0
This model is an automatic speech recognition model fine-tuned on the Romansh-Sursilvan dialect dataset based on facebook/wav2vec2-xls-r-1b, achieving a word error rate (WER) of 13.82% on the Common Voice 8 test set.
Speech Recognition
Transformers

W
sammy786
18
0
Wav2vec2 Large Xls R 300m Ha Cv8
Apache-2.0
A Hausa speech recognition model fine-tuned on the Common Voice dataset based on facebook/wav2vec2-xls-r-300m
Speech Recognition
Transformers Other

W
anuragshas
17
1
Beto Sentiment Analysis
A sentiment analysis model trained based on the BETO Spanish BERT model, supporting POS/NEG/NEU three-class sentiment classification
Text Classification Spanish
B
finiteautomata
339.11k
30
Bert Base Arabertv02 Twitter
A BERT model optimized for Arabic dialects and tweets, pre-trained on 60 million Arabic tweets with MLM tasks, with added support for emojis and common vocabulary.
Large Language Model
Transformers Arabic

B
aubmindlab
2,148
8
Bert Large Arabertv02 Twitter
AraBERTv0.2-Twitter is a pre-trained language model optimized for Arabic dialects and tweets, developed based on the BERT architecture, with added support for emojis and common vocabulary.
Large Language Model
Transformers Arabic

B
aubmindlab
312
4
Albert Large Arabic
Arabic pretrained version of ALBERT large model, trained on approximately 4.4 billion words of Arabic corpus
Large Language Model
Transformers Arabic

A
asafaya
45
1
Bp Cetuc100 Xlsr
Apache-2.0
Wav2vec2 model fine-tuned for Brazilian Portuguese using the CETUC dataset, trained with approximately 145 hours of Brazilian Portuguese speech data
Speech Recognition
Transformers Other

B
lgris
22
0
Wav2vec2 Large Xls R 300m Hindi
Apache-2.0
This is an automatic speech recognition (ASR) model fine-tuned on Hindi speech datasets based on Facebook's wav2vec2-xls-r-300m model
Speech Recognition
Transformers Other

W
infinitejoy
13
0
Wav2vec2 Xls R 300m Zh HK Lm V2
Apache-2.0
An automatic speech recognition model based on XLS-R architecture, optimized for Cantonese (zh-HK), fine-tuned on the Common Voice dataset and enhanced with a 5-gram language model.
Speech Recognition
Transformers

W
w11wo
25
0
Albert Xlarge Arabic
An Arabic version of the ALBERT Xlarge pretrained language model, trained on approximately 4.4 billion words, supporting Modern Standard Arabic and some dialectal content.
Large Language Model
Transformers Arabic

A
asafaya
64
1
XLS R Marathi
Apache-2.0
An automatic speech recognition model fine-tuned on Marathi datasets based on facebook/wav2vec2-xls-r-300m
Speech Recognition
Transformers Other

X
StephennFernandes
34
0
Wav2vec2 Large Xls R 300m Galician
Apache-2.0
This is an automatic speech recognition model fine-tuned on Galician speech datasets based on facebook/wav2vec2-xls-r-300m.
Speech Recognition
Transformers Other

W
infinitejoy
31
0
Wav2vec2 Large Xlsr 53 Chinese Zh Cn Gpt
Apache-2.0
A Chinese (zh-CN) speech recognition model fine-tuned on the Common Voice dataset based on facebook/wav2vec2-large-xlsr-53
Speech Recognition
Transformers Chinese

W
ydshieh
127
32
Bert Medium Arabic
Pre-trained Arabic BERT medium language model, trained on approximately 8.2 billion words of Arabic text resources
Large Language Model Arabic
B
asafaya
66
0
Bert Large Arabertv2
AraBERT is a pre-trained language model based on Google's BERT architecture, specifically designed for Arabic natural language understanding tasks.
Large Language Model Arabic
B
aubmindlab
334
11
Ara DialectBERT
A BERT model for Arabic dialects, further trained on the HARD-Arabic-Dataset based on CAMeL-Lab's bert-base-camelbert-msa-eighth model
Large Language Model Arabic
A
MutazYoune
22
0
Xls R 2b Nl V2 Lm 5gram Os2 Hunspell
A CTC model based on XLS-R with a 5-gram language model from Open Subtitles, primarily used for automatic speech recognition in Dutch and Flemish.
Speech Recognition
Transformers Other

X
FremyCompany
18
4
Bert Base Arabic Camelbert Mix Ner
Apache-2.0
An Arabic named entity recognition model fine-tuned based on CAMeLBERT Mix, supporting entity recognition in Modern Standard Arabic, dialects, and Classical Arabic
Sequence Labeling
Transformers Arabic

B
CAMeL-Lab
24.24k
13
XLSR 300M Nynorsk
Apache-2.0
A Nynorsk automatic speech recognition model based on the XLSR-300M architecture, trained on the NPSC dataset with low word error rate and character error rate.
Speech Recognition
Transformers

X
NbAiLab
22
0
Opus Mt Sv ZH
Apache-2.0
A Transformer-based machine translation model for Swedish to Chinese, supporting multiple Chinese variants, developed by the Helsinki-NLP team
Machine Translation
Transformers Supports Multiple Languages

O
Helsinki-NLP
13
0
Bert Base Arabic Camelbert Msa Did Nadi
Apache-2.0
A dialect identification model fine-tuned based on the CAMeLBERT Modern Standard Arabic model, supporting 21 Arabic dialect identifications.
Text Classification
Transformers Arabic

B
CAMeL-Lab
41
0
Opus Mt Zh De
Apache-2.0
This is a machine translation model from Chinese to German based on the transformer-align architecture, supporting translation from various Chinese dialect variants such as Mandarin and Cantonese to German.
Machine Translation
Transformers Supports Multiple Languages

O
Helsinki-NLP
472
0
Nb T5 Base V3
This is a Norwegian T5-based model trained on the Norwegian Colossal Corpus (NCC) using TPU v3-8.
Large Language Model Other
N
NbAiLab
21
0
- 1
- 2
Featured Recommended AI Models